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Receipt is acknowledged of this nonprovisional Patent Application. It will be considered in its order and you 
will be notified as to the results of the examination. Be sure to provide the U.S. APPLICATION NUMBER, 
FILING DATE, NAME OF APPLICANT, and TITLE OF INVENTION when inquiring about this application. 
Fees transmitted by check or draft are subject to collection. Please verify the accuracy of the data presented 
on this receipt. If an error is noted on this Filing Receipt, please write to the Office of Initial Patent 
Examination's Customer Service Center. Please provide a copy of this Filing Receipt with the 
changes noted thereon. If you received a "Notice to File Missing Parts" for this application, please 
submit any corrections to this Filing Receipt with your reply to the Notice. When the PTO processes 
the reply to the Notice, the PTO will generate another Filing Receipt incorporating the requested 
corrections (if appropriate). 
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Projected Publication Date: N/A 
Non-Publication Request: No 



Early Publication Request: No 
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GRANTED 



The applicant has been granted a license under 35 U.S.C. 184, if the phrase "IF REQUIRED, FOREIGN 
FILING LICENSE GRANTED" followed by a date appears on this form. Such licenses are issued in all 
applications where the conditions for issuance of a license have been met, regardless of whether or not a 
license may be required as set forth in 37 CRF 5.15. The scope and limitations of this license are set forth in 
37 CFR 5.15(a) unless an earlier license has been issued under 37 CFR 5.15(b). The license is subject to 
revocation upon written notification. The date indicated is the effective date of the license, unless an earlier 
license of similar scope has been granted under 37 CFR 5.13 or 5.14. 

This license is to be retained by the licensee and may be used at any time on or after the effective date 
thereof unless it is revoked. This license is automatically transferred to any related applications(s) filed under 

36 CFR 1 .53(d). This license is not retroactive. 

The grant of a license does not in any way lessen the responsibility of a licensee for the security of the subject 
matter as imposed by any Government contract or the provisions of existing laws relating to espionage and 
the national security or the export of technical data. Licensees should apprise themselves of current 
regulations especially with respect to certain countries, of other agencies, particularly the Office of Defense 
Trade Controls, Department of State (with respect to Arms, Munitions and Implements of War (22 CFR 121- 
128)); the Office of Export Administration, Department of Commerce (15 CFR 370.10 (j)); the Office of 
Foreign Assets Control, Department of Treasury (31 CFR Parts 500+) and the Department of Energy. 

NOT GRANTED 

No license under 35 U.S.C. 184 has been granted at this time, if the phrase "IF REQUIRED, FOREIGN 
FILING LICENSE GRANTED" DOES NOT appear on this form. Applicant may still petition for a license under 

37 CFR 5.12, if a license is desired before the expiration of 6 months from the filing date of the application. If 
6 months has lapsed from the filing date of this application and the licensee has not received any indication of 
a secrecy order under 35 U.S.C. 181, the licensee may foreign file the application pursuant to 37 CFR 5.15 



PLEASE NOTE the following information about the Filing Receipt: 

• The articles such as "a," "an" and "the" are not included as the first words in the title of an application. 
They are considered to be unnecessary to the understanding of the title. 

• The words "new," "improved," "improvements in" or "relating to" are not included as first words in 
the title of an application because a patent application, by nature, is a new idea or improvement. 

• The title may be truncated if it consists of more than 600 characters (letters and spaces combined). 

• The docket number allows a maximum of 25 characters. 

• If your application was submitted under 37 CFR 1.10, your filing date should be the "date in" found on 
the Express Mail label. If there is a discrepancy, you should submit a request for a corrected Filing 
Receipt along with a copy of the Express Mail label showing the "date in." 

• The title is recorded in sentence case. 
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Assistant Commissioner for Patents 
Office of Initial Patent Examination 
Customer Service Center 
Washington, DC 20231 



A Scalable System for Clustering of Large Databases Having Mixed Data Attributes 

Cross Reference to Related Applications 

The present invention is a continuation in part of co-pending United States 
patent application serial number 09/083,906 entitled "A Scalable System for 
Expectation Maximization Clustering of Large Databases" to Fayyad et al having a 
filing date of May 22, 1998, and which is assigned to the assignee of the present 
application and is incorporated herein by reference. 

Priority is claimed from provisional United States patent anolication serial 
number 60/086,410 filed May 22, 1998. MbUtl VED 

SEP 0 6 2001 

Field of the Invention Technology Center 21 00 

The present invention concerns database analysis and more particularly 
concerns an apparatus and method for clustering of data into groups that capture 
important regularities and characteristics of the data. 

Background Art 

Large data sets are now commonly used in most business organizations. In 
fact, so much data has been gathered that asking even a simple question about the data 
has become a challenge. The modern information revolution is creating huge data 
stores which, instead of offering increased productivity and new opportunities, can 
overwhelm the users with a flood of information. Tapping into large databases for 
even simple browsing can result in a return of irrelevant and unimportant facts. Even 
people who do not 'own' large databases face the overload problem when accessing 
databases on the Internet. A large challenge now facing the database community is 
how to sift through these databases to find useful information. 

Existing database management systems (DBMS) perform the steps of reliably 
storing data and retrieving the data using a data access language, typically SQL. One 
major use of database technology is to help individuals and organizations make 
decisions and generate reports based on the data contained in the database. 

An important class of problems in the areas of decision support and reporting 
are clustering (segmentation) problems where one is interested in finding groupings 
(clusters) in the data. Data clustering has been used in statistics, pattern recognition, 
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